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Abstract. Recently, cosmic shear, the weak lensing effect by the inhomogeneous matter distribution in the Universe, 
has not only been detected by several groups, but the observational results have been used to derive constraints 
on cosmological parameters. For this purpose, several cosmic shear statistics have been employed. As shown 
recently, all two-point statistical measures can be expressed in terms of the two-point correlation functions of the 
shear, which thus represents the basic quantity; also, from a practical point-of-view, the two-point correlation 
functions are easiest to obtain from observational data which typically have complicated geometry. We derive 
in this paper expressions for the covariance matrix of the cosmic shear two-point correlation functions which 
are readily applied to any survey geometry. Furthermore, we consider the more special case of a simple survey 
geometry which allows us to obtain approximations for the covariance matrix in terms of integrals which are 
readily evaluated numerically. These results are then used to study the covariance of the aperture mass dispersion 
which has been employed earlier in quantitative cosmic shear analyses. We show that the aperture mass dispersion, 
measured at two different angular scales, quickly decorrelates with the ratio of the scales. Inverting the relation 
between the shear two-point correlation functions and the power spectrum of the underlying projected matter 
distribution, we construct estimators for the power spectrum and for the band powers, and show that they yields 
accurate approximations; in particular, the correlation between band powers at different wave numbers is quite 
weak. The covariance matrix of the shear correlation function is then used to investigate the expected accuracy of 
cosmological parameter estimates from cosmic shear surveys. Depending on the use of prior information, e.g. from 
CMB measurements, cosmic shear can yield very accurate determinations of several cosmological parameters, in 
particular the normalization erg of the power spectrum of the matter distribution, the matter density parameter 
S7 m , and the shape parameter V. 

Key words, cosmology - gravitational lensing - large-scale structure of the Universe 
1. Introduction 

Cosmic shear, the distortion of the images of distant galaxies by the tidal gravitational held of intervening matter 
inhomogeneities, offers a direct way of probing the statistical properties of the large-scale (dark) matter distribution 
in the Universe, without making any assumption on the relation between dark and luminous matter (e.g., Blandford 
et al. 1991, Miralda-Escude 1991, Kaiser 1992, 1998, Jain & Seljak 1997, Bernardeau et al. 1997, Schneider et al. 1998, 
hereafter SvWJK, van Waerbeke et al. 1999; Bartelmann & Schneider 1999; Jain et al. 2000, White & Hu 2000; see 
Mellier 1999 and Bartelmann & Schneider 2001 for recent reviews). The first detections of cosmic shear on wide-field 
imaging data (Bacon et al. 2000, 2002; Kaiser et al. 2000; van Waerbeke et al. 2000, 2001, 2002; Wittman et al. 
2000, Maoli et al. 2001; Rhodes et al. 2001; Hammerle et al. 2002; Hoekstra et al. 2002; Refregier et al. 2002) has 
demonstrated the feasibility of this new window of observational cosmology, and yielded already the first constraints 
on cosmological parameters, most noticibly the normalization ag of the dark matter power spectrum, but also on the 
matter density parameter fl m (van Waerbeke et al. 2002; Hoekstra et al. 2002). 
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Most analytical work on cosmic shear has been done on two-point statistical measures of the distortion field, such 
as the shear correlation functions, the shear variance in an apertures, or the aperture mass (see Sect. 2 for a definition 
of these quantitities). Although higher-order statistical measures, such as the skewness of the shear (Bernardeau et al. 
1997), are likely to yield additional, if not even superior constraints on cosmological parameters, their theoretical pre- 
dictions are more uncertain at present. Recently, an estimator for the skewness of the shear was developed (Bernardeau 
et al. 2002a), and applied to wide-field survey data (Bernardeau et al 2002b), yielding a significant detection. 

In this paper we consider second-order statistical measures only. All of them can be derived in terms of the 
correlation functions, as shown in, e.g., Crittenden et al. (2002, hereafter C02) and Schneider et al. (2002, hereafter 
SvWM), and since the measurement of the correlation functions is easier in practice than the other two-point statistics 
(e.g., gaps in the data are easily dealt with), we consider the correlation functions as the fundamental obscrvables 
from a cosmic shear survey. In order to use them for determining cosmological parameters, it is important to know a 
practical and unbiased estimator for them, and to determine the covariance of this estimator. Two effects enter this 
covariance: a random part, which is due to the intrinsic ellipticity of the galaxies from which the shear is measured, 
together with measurement noise, and sampling (or cosmic) variance. The first of these effects is expected to dominate 
on small angular scales, whereas the second determines the covariance for large separations. The covariance will depend 
not only on the total survey area, but also on the survey geometry. As has been pointed out by Kaiser (1998), in order 
to decrease the sampling variance on large scales, it may be favourable to choose a survey geometry that samples those 
scales sparsely. In order to design an optimized survey geometry, the covariance as a function of survey geometry needs 
to be calculated. 

Here, we calculate the covariance matrices for the shear correlation functions binned in angular separation. In 
Sect. 2, we introduce our notation and briefly summarize the two-point cosmic shear measures and their interrelations. 
Unbiased estimators of the two basic correlation functions are derived in Sect. 3, togthcr with the corresponding 
unbiased estimators of the aperture mass and the shear dispersion. The covariance matrices of these correlation 
function estimators are then derived in Sect. 4, expressed in terms of a set of galaxy positions. From these expressions, 
the covariances can be determined for an arbitrary survey geometry. In a forthcoming paper (Kilbinger et al., in 
preparation), we shall use the results of Sect. 4 to design an optimized geometry for a planned cosmic shear survey. 

For the case of a filled survey geometry, the ensemble average of these covariances can be further analyzed; using 
a few approximations, we express in Sect. 5 the covariances for this case in terms of integrals. The corresponding 
expressions have been evaluated, for a particular cosmological model, and are illustrated in a set of figures. In Sect. 6 
we derive the covariance for the aperture mass dispersion, which can be expressed simply in terms of the covariances 
of the correlation functions. The variance of the aperture mass dispersion, as well as the covariance, is then explicitly 
calculated for a survey with filled geometry, showing that indeed the aperture mass at two angular scales decorrelates 
quickly as the scale ratio decreases away from unity. 

We then turn in Sect. 7 to a simple estimator of the power spectrum of the projected cosmic density field, which 
can be expresed in terms of the correlation functions. Since the correlation functions will be known only over a finite 
range in angular separation, the simple estimator we derive is biased. We show that, provided the angular range on 
which the correlation functions can be measured is as large as can be expected with the next generation of cosmic 
shear surveys, this bias is indeed very small over a large range of wave numbers. We derive the covariance of the 
power spectrum estimator and calculate it explicitly for the filled survey geometry case; the resulting error bars on the 
estimated power spectrum are quite a bit smaller than one might have expected, given the simplicity of the approach. 
In Sect. 8 we consider the accuracy with which the parameters of the cosmological model can be constrained, given 
a survey area. In fact, by fitting the correlation function directly to model predictions, even the currently available 
cosmic shear surveys can yield fairly accurate constraints on cosmological parameter. Finally, we summarize our results 
in Sect. 9. 

In this paper we shall assume that the observable shear is due to the tidal gravitational field of the cosmological 
matter distribution only; in this case, the two shear components are not mutually independent. This is due to the fact 
that the gravitational field is a gradient field. In the language of some recent papers (e.g., C02; Pen et al. 2002; SvWM), 
we thus assume that the shear field is a pure E-mode field. B-modes, (or the 'curl component'), can in principle be 
generated if the intrinsic orientation of the galaxies from which the shear is measured are correlated, e.g. due to tidal 
interactions of dark matter halos in which these galaxies are formed (Croft & Metzler 2000; Pen et al. 2000; Heavens 
et al. 2001; Catelan et al. 2001; Mackey et al. 2002; Brown et al. 2002). Also, the clustering of source galaxies in 
rcdshift space generates a B-mode contribution which, however, turns out to be fairly small (SvWM). This restriction 
to E-modes only affects the interrelations between various two-point statistics. Inclusion of B-modes would not change 
the results of Sects. 3 through 5, and much of Sects. 6 and 7 will also be left unaffected in the presence of a B-mode 
contribution; we shall indicate this in due course. 
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2. Two-point measures of cosmic shear 

We follow here the notation of Bartelmann & Schneider (2001). The power spectrum of the projected density field is 
denoted by P K (£), where i is the Fourier variable on the sky. 

The shear correlation functions are defined by considering pairs of positions i9 and and defining the tangential 

and cross-component of the shear 7 = 71 + i72 at position i9 for this pair as 

7 t = -^c( 7 e- 2 ^) ; 7x =-Im(7C- 2i ") , (1) 

where tp is the polar angle of the separation vector 6. Then we define the two shear correlation functions (e.g., Kaiser 
1992) 

e±(0) := (7t7t) ± (7x7x) = ^ I™ deep K (£)j 0A (ee) , (2) 



where the Bessel function Jo (J4) corresponds to the '+' ('— ') correlation function. The shear dispersion in a circle of 
radius 9 is defined by considering circular apertures and ensemble-averaging over the square of the complex shear; its 
relation to the power spectrum reads (e.g., Kaiser 1992) 

Furthermore, the aperture mass M ap in an aperture of radius 9 is defined as a weighted average over the tangential 
shear component (see, e.g. Schneider 1996; SvwJK), and its dispersion is related to the power spectrum by 

1 r „c 0^576 [nm? 



where the same weight function in the definition of M ap as in SvWM was assumed. 

All these 2-point statistics are thus linearly filtered versions of the power spectrum P R , where the filter functions 
are quite different between the various statistics. For the correlation function £+, the filter function 3q(ji) is very broad, 
about constant for 77 1 , and oscillating for large r/, with an amplitude decreasing as rj^ 1 ' 2 . The filter function J 4(77) 
for has the same slow decrease, but behaves as rf for small rj, and is therefore more localized than the one for £ + . 
The filter function for the shear dispersion, [2 Ji (77) /r/] 2 , is a low-pass filter, i.e. constant for 77 1 , and then decreasing 
in amplitude as ry -3 for large r\. Finally, the filter function for the aperture mass dispersion is [2A3 4(1]) / rj 2 ] 2 and thus 
behaves like rf for small 77, and decreases oscillatory as r/~ 5 for 77 — > 00. Hence, (M ap ) yields the most local estimate 
of the underlying power spectrum of the projected mass. On the other hand, because it is so localized, it contains less 
power in its filter, so that the value of (M 2 p ) is smaller than that of (\l\ 2 ^ 011 the same filter scale 9. 

The various two-point statistics of the shear are related to each other; in particular, they can all be expressed 
in terms of the correlation functions, as was shown in C02, Pen et al. (2002) and SvWM. We briefly summarize the 
results here. 

Making use of the orthogonality of Bessel functions, the power spectrum can be expressed in terms of the correlation 
functions £+(9) and £-(9), by multiplying eqs.(|^) by 9Jq(£9) and 9 3a(£9), respectively, and then integrating over 9, 
to obtain 

/•OO P 00 

P K (£) = 2ir d9 9 £ + (9) J (t9) = 2?r / dO 0f_(0) h(£9) . (5) 
Jo Jo 

These equations express the power spectrum directly in terms of the observable correlation function; however, in order 
to evaluate P K (i) from them, one would need to know the correlation functions for all angles. In Sect. 7 below, we 
shall investigate how well the power spectrum can be determined from knowing the correlation function over a limited 
range of separations. 

The two equations (|J) and (|5|) allow us to express in terms of and reversely (see SvWM for a derivation), 
p8 / 4 1 9i9 2 \ r°° / 4 1 9fl 2 \ 

e_(0) = e+(<o+ y o d#n+&)[ ¥ --g r ) ; J ^u^--— j . (6) 

Hence one can obtain £-(9) from the correlation function in the interval < •& < 9, and so this relation can be 

directly applied to observational data, with a minor extrapolation to small separations. The reverse relation, expressing 
in terms of is less useful in practice, owing to the infinite range of integration. 
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Next we express the shear dispersion (||) in terms of the correlation function, by inserting (||) into (^); this yields 
(van Waerbeke 2000; SvWM) 



IT) 



where 



S+(x) = 4 / — J {xt) [Ji(t)] 2 = - 4arccos(^ ) - zV4 - x 2 H(2 - x) , 

J t TT I \2J J 



'0 

, f °° dt T , WT , Sl2 xV4 - x 2 (6 - x 2 ) - 8(3 - x 2 ) arcsin(a;/2) TT ,„ . 4(x 2 - 3) TT , 
S_(a;) = 4 / — J 4 (arf) [Ji(t)] 2 = — ^ >—^ A y -±^E(2~x)+ 1 4 ; H(x - 2) , (8) 



o 



t 



7T2T 



.7."* 



and H(x) is the Heaviside step function. Hence, the function S+(x) vanishes for x > 2, so that the shear dispersion 
can be expressed as a finite-range integral over the correlation function £ + ; this is not the case for S— . 

Similarly, one can express the aperture mass dispersion in terms of the correlation functions, by inserting (j^) into 



dip ip . . (<p\ I dtp p 



where 

poo ii 

T+(x) = 576 / ^Jo(xi) [3 4 (t)} 2 



e 2 



e-(,)T_(|) 



(9) 



6(2- 15a; 2 ) 



i 2 • ( x 
1 arcsm — 

TT V2 



xV 4 — a; 2 



T_(a;) = 576 



^J 4 (xi) [J 4 (i)] 2 



192 
35^ : 



lOOvr 

_ x 2 
4 



'120 + 2320x 2 - 754x 4 + 132x 6 - 9x 8 ) } H(2 - x) 



(10) 



7/2 



H(2-2!) 



Both of these functions vanish for x > 2, so that (Af 2 p ) can be expressed by a finite integral over either £±. The 
functions S± and T± are plotted in SvWM. 

The fact that we can express the shear dispersion and the aperture mass dispersion directly in terms of the 
correlation function over a finite interval is expected, given that the estimators of both statistics include products 
of cllipticities of pairs of objects separated by no more than the diameter of the aperture. However, what could not 
have been guessed a priori is that (M 2 p } (9) can be expressed by a finite integral over £ + and £_ separately. The 
determination of these statistics in terms of the correlation function is in practice easier than laying down apertures, 
owing to the holes and gaps in a data set; in addition, a comparison of the directly determined shear and aperture 
mass dispersion with those obtained from the correlation functions yields a useful check on the integrity of the data. 



3. Estimators 

We shall now consider practical estimators of the correlation functions and the other two-point statistics. The observ- 
able ellipticity ti of a galaxy image at angular position 0; is related to the intrinsic ellipticity e\ and the shear 7(6,) 

by 

ei = eS+7(*i), (11) 

where it has been assumed that I7I -C 1 for this weak lensing relation to be valid. In addition to an ellipticity, each 
galaxy may carry a weight factor wi which reflects the precision with which its ellipticity can be determined - noisy 
objects can then be downweighted by assigning small values of Wi to them (Hockstra et al. 2000; Erbcn et al. 2001; 
Bacon et al. 2001; Pen et al. 2002). We shall assume that the correlation function is to be estimated in bins of angular 
width A$, and define the function A^(<^>) = 1 for $ — A'ff/2 < <f> < $ + A$/2, and zero otherwise; hence, A#(0) defines 
the bin at angle ■&. An estimator for the correlation function is then 

y"L-„- W4 (eitCAt + ei X e,- x ) A#(\0i — OA) . , 

(i?) = ij jv p( y — ' NpW =^ wiw > A " m 6ji) ; (i2) 



A7p(i?) is the effective 'number of pairs' in the bin considered (in fact, if all weights are unity, N p is twice the number 
of pairs) , and the tangential and cross components of the ellipticity are defined in analogy to the corresponding shear 
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components in (|l|). The expectation value of this estimator is obtained by averaging over the source ellipticities, 
assumed to be randomly oriented, and an ensemble average of the shear field. Since 

{eitejt + e lx e JX ) = a^Sij + - dj\) , (13) 

where of is the dispersion of the intrinsic galaxy ellipticity, we see immediately that £ + is an unbiased estimator of 



e+W)=e+W, (14) 
since the product 5ijA§(\0i — 9j\) vanishes for all pairs i, j. Analogously, an unbiased estimator for £_ is 

._ _ E , j ^ 1 ^ lt ^, ) A,, m - 6 , f) ^ _ (i5) 

Next we obtain an unbiased estimator for the aperture mass dispersion. For that we assume that the centers of the 
bins on which the correlation function is calculated are described by 0j = (i — 1/2) A#, and that the aperture radius 
9 is an integer multiple of the bin width, 9 = m Aft, Then, the integrals in (^]) are replaced by sums over the bins, 
yielding an estimator for (Mf p (0)), 



A ,9 2m 



K + t + Wi)T + [%)+(!- K + )LmT- ( ^ 



(16) 



where K + describes the relative contributions of the two expressions (|^). For example, in the presence of B-modes, 
one needs to use K + = 1/2 (SvWM). Similarity, an unbiased estimator for the shear dispersion is 



2m /,o.\ 00 /,q. 



(17) 



again, in the presence of B-modes, K + = 1/2 shall be chosen (however, the infinite support of S- requires the knowledge 
of the correlation function £_ for all separations). Both of the above are unbiased estimators, (A4(9)) = (Mf p (#)), 

(S(9)) = (\ 1 \ 2 )(9). 



4. Covariance of the estimators 

Next we calculate the covariance of the various estimators, starting with the correlation functions. Hence, we define 
Cov(|±,i? i; | ± ,tf 2 ) := ((|±(0i)-£ ± (0i)) (£±(02) -£±(02))) • (18) 
Consider first the '++ '-covariance function, for which one needs to evaluate 

(l+(0i)l+(02)) = TTTTT^FTTtII WiWjW k wiA^{ij)A^{kl) ((eaeji + e i2 e j2 ) (e fc ie a + e k2 e l2 )) , (19) 

where wc defined A#(ij) = A$(\di — 6j\) and used the fact that eaCjt + £iy.ejx = eu£ji + ^a^ji- Next, the four-point 
correlation function of the ellipticities needs to be evaluated. For that, we use ( pi] ) and expand the resulting expression. 
Only terms of even power in e s and 7 survive the averaging over the source ellipticities and the ensemble average. 
Then, 



> = (lie, 

+ ( e L e i/3 e fe/j £ L) > (20) 



valid for i ^= j and k 7^ I, as needed in (|19[); here, Greek letters are € {1, 2}. To evaluate the four-point function of the 
shear, we shall assume that the shear field is Gaussian, so that the four-point function can be written as a sum over 
products of two-point functions. We shall later comment on the effect this assuption has on the determination of the 
covariances. The four-point function of the intrinsic ellipticity also factorizes, since at most two of the indices i,j,k,l 
are equal. Therefore, 
2 

{^ia e j/3 e k^lu) = -£-($jl&Py (lialkp.) + SjkSfJfi ("fia'flu) + SilS av (jjfj"fkp.) + <5jfc<W {ijpllv) 

r 2N 2 



+ (lialjp) {ikullv) + (7ia7fcA») (liPllv) + (licfllv) (7j'/37*m) + ( "jf ) (^^"W^ + SuSjkSauSpn) . (21) 
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The correlation functions of the shear components can be expressed as (Kaiser 1992) 

(7ii7ji> = \ + cos(4^-)] ; (7i27i2) = \ K+fe') - cos(4^)] ; (7^2) = sin(4^-) , (22) 

where we have written £±(ij) = £±(|#j — &j\)> an d <Pij is the polar angle of the difference vector 0; — 8j. From these 
relations, one obtains for the covariance matrix 



Cov(£+,0i;£f,tf 2 ) 



1 



ij ijk 



+ 



WjWjWkWiA^ (ij)A# 2 (kl) (£ + (il)£ + (jk) + cos [4 (ip a - tp jk )]^_(il)^_(jk) 



ijkl 



(23) 



where the function 5($i — $2) is zero if the two separation bins are different, and is 1 if they are the same. The first term 
in (|2^) therefore contributes only to the diagonal terms in the covariance matrix. In the absence of shear correlations, 
the covariance matrix would be diagonal; correlation populates the off-diagonal elements of the covariance tensor. 
To calculate the covariance matrix for the £_ correlation function, we first write 



(eaeji - e i2 e j2 ) cos 4^- + (e^e-,2 + e^i) sin 4^- ; 



inserting this into the definition of the covariance matrix and performing the same step as for the 
one finds 



covariance, 



Cov(£_,z? i; £_,z? 2 ) 



1 



JVp(tfl)JVp(02) 



(24) 



ijk 



+ ^2w i w :j w k wi A^ 1 (ij)A^ 2 (kl)(cos [4(y>y - (pu - tp jk + <pki)] £-(il)(i-(jk) + cos [4(<p y - tp kl )] £,+ (H)£+(jk) 

ijkl 

Finally, the mixed covariance matrix can be calculated in the same manner, yielding 



Cov^+A^A) 



JV p (i?i)Ap(i? a 



2ct £ 2 2J wfw j w k A i}l (ij)A$ 2 (ik) cos [4(^ fc - ip jk )] (jk) 

ijk 



2 ^2 w i w w k wiA$ 1 (ij)A# 2 (kl) cos [4,(tp a - (p kl )} C-(«0C+(i fc ) 

ijkl 



(25) 



5. Averaging over an ensemble of galaxy positions 

Given a model for the shear correlation, the covariances (p3|-p5|) can be calculated using the actual galaxy positions 
and their weight factors; in principle, this procedure is straightforward to apply to a given data set. Alternatively, 
given the geometry of a data field, then by randomly distributing galaxy positions the expected covariance of the shear 
correlation can be determined; this procedure can be used to design and optimize cosmic shear surveys (Kilbinger 
et al., in preparation). These calculations are, however, time-consuming, given the sum over three and four galaxy 
positions. It is therefore of interest to consider a relatively simple situation where fairly explicit expressions for the 
covariance matrices can be obtained. In particular, we shall calculate the ensemble average of the covariance matrices. 
We consider here a survey geometry which consists of a single data field of solid angle A and galaxy number density 
n, so that N = nA is the total number of galaxies in the survey. The survey geometry is assumed to be 'simple', 
i.e. consisting of a simply connected region, say, a quadratic field. We shall assume that all weight factors are unity, 
Wi — 1. Furthermore, we shall consider separations di which are small compared to the 'diameter' of the survey field, 
<C A; in this case, we can neglect 'boundary effects' which otherwise would complicate the analysis tremendously. 
With these assumptions, the number of pairs in the bin characterized by •& is then 



N p ($) = An2ir$A$n . 

The ensemble average over galaxy positions is carried out by the averaging operator 

E=n(Vd^ 



(26) 



i=l 
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i.e. the galaxies are assumed to be randomly placed on the field. The average of the first term in (|23|) and (|24|) is 
simple and reads 

E ( ? a * <«>) " # - *> = : - *> ■ (27) 

As expected, this term depends explicitly on the bin size chosen, since A^ p cx Az9. In practical units, 

D - 3.979 x 10- ( " ,) " (*) " " . 

V0.3/ Vldeg 2 / V30arcmin- 2 y V larcmin / V 0.1 / 

As we shall see, the other terms are independent of the choice of the bins. To evaluate the other terms, one notes that 
the expectation value of all terms involving a sum over three galaxy positions can be written in terms of 

E a bc = E f ^ (ij) A# 2 (ik) F a (cpij) F b (ipik) F c (6k - 0j) J , (28) 

or linear combinations thereof. Applying the averaging operator, we note that there are N(N — 1){N — 2) i=s A^ 3 
permutations of galaxies, so that 

r3 



Eabc = ^3 / d 2 d 1 J d 2 d 2 J d 2 e 3 A §1 (12)A^(13)F a {^ 12 )F b (^ 13 )F c (e 3 -e 2 ) 

= -IJ d<Ai0i / dv?iF a (^i) / d02</>2 / dip 2 F b (ip 2 ) F c (<p 2 - X ) 

^ Ji?i-Ai?/2 JO Jtf 2 -Atf/2 JO 



'i?i-Ai?/2 JO J)?2-Ai9/2 

where in the second step we have written 02 = $i + </>i, $3 = $i + 02 j so that 6 3 — 2 = cf) 2 — <fi 11 fi and </?2 denote 
the polar angles of 4> 1 , <fi 2 , respectively. After this substitution, the integral becomes independent of 0±, which can be 
integrated to yield a factor A. Assuming that the bin width Az9 is small, one can evaluate the <p\ and <f> 2 integrals, to 
obtain 

Eabc 1 , jp f \ A T-i / , „ / l?2COSI/) 2 -l?l COS (fl 



Np(m) - w*. u ^ FM L ^ F < I - * *z ) ■ (29) 

Similarily, the expectation value of all terms involving a sum over four galaxy positions can be written in terms of 
Eabcd = E \Y,^dij)^2{kl)F a (e k -e j )F b (ei-e i )F c (<p ij )F d (ip kl ) 





ijkl 














A 3 " 





/•#i+Ai)/2 p2tt <.i9 2 +Ai9/2 p 2 tt 

/ d^i^i / 6sp 1 F c ((pi) &4> 2 <j) 2 / d^2F«j(^a)F o (0- <t> x ) F b {cp + <p 2 ) , 

JO J)S 2 -Ai9/2 JO 

where in the second step we defined 02 = $i + 4>\, $4 = #3 + 4> 2 ] then, the arguments of the functions F a and F 
become 6 3 — 9 2 = 6 3 — Q\ — <p 1 and 84 — Q\ = 9 3 — B\ + <p 2 , i-e. they depend only on the difference cp = O3 — d\, 
so that the integral over 9 3 can be carried out, yielding a factor A. Performing the 4>i and 4> 2 integration, assuming 
small bin width, one obtains 



EnhrA 1 



p27T /"27T /*27r 

.0/ d Vl F c (^) d^ 2 F d (<p 2 ) dipF a (iP a )F b (iP b ) , (30) 



iVp(tfi)A/p(i?2) (27r)My 
where the vectors 

_ / </>COS(y9 - 1?! COS<,9i \ . _ / COS V3 + 7j 2 COS 932 \ /o-,^ 

y (/isin^ — $1 sin 931 J 1 b \ (f) sin tp + d 2 sin ip 2 J 

have been defined for later convenience. 

Next we shall evaluate the second term of ( p3| ) which is of the form (p8[), with F a = 1 = F b , F c (xp) = ^ + (|i/j|); 
inserting these expressions into (^9|) , the expectation value of the second term in ( |23| ) becomes 

q++ = ^J Q d ^ ^+ ^2-2^1^2 cos^) . (32) 
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The expectation value of the third term in ( p3| ) can be calculated by splitting it up into three parts: in the first, 
F a {ip) = = F b (xp), and the other two are obtained by expanding the cosine-term, so that for one of them, 

F a (ip) = cosAp^ = F b (ip), and for the other, cos is replaced by sin; here, ip^ is the polar angle of ip. Note 
that all three terms have F c = 1 = Fj. The final result then reads 



E 



(Cov(| +) tf i; | + ,tf 2 )) =D5(# 1 -# 2 )+q ++ +r +0 +r +1 , 
where D and q++ have been defined in ( p7j ) and (|3^), respectively, 



(33) 



r+o = 



r+i 



ttA 



(2n)A 



(/) dipx^-QijiJ) dip 2 £-(\fPb\) [cos4ip a cosAipt + sinAip a sm4ip b ] , 



(34) 



and (p a , (p b are the polar angles of ip a , tp b , respectively, cos4<,5 a = 1 - Sipl 1 ipl 2 /\ip a \ i , sin4<p a = iip a itp a 2 C0oi - 
V'i^VIV'al'S an d the analogous expressions for tfb- Note that the <p- integration present in ( pO[ ) has dropped out as the 
integrand depends only on <p\ — (p, and <p 2 — <P', hence, the ^-integration can be carried out and one can use (f = in 

Several issues are worth mentioning: (1) only the first term containing the 'delta function' depends on the bin 
width A$; thus, the bin width only affects the autovariance. (2) All terms are proportional to A -1 ; hence, the relative 
contribution of the terms is independent of the survey area, at least for separations well below the 'diameter' of the 
survey area for which the foregoing procedure of the ensemble averaging is valid. (3) The terms denoted by V are 
independent of the intrinsic cllipticity dispersion and of the number density of galaxies. Hence, these terms describe 
the cosmic variance and thus provide a limit on the accuracy of the determination of the correlation function for a 
given survey geometry, independent of the observing conditions which determine n. 

The expectation values of the other covariance matrices are calculated in a similar manner. Consider the ' ' 

covariance ( p4| ) next; the first term agrees with that of (p3|). For the second term, we can apply (|29| ), after expanding 
the cosine; then F a (ip) and Fb(p) are either cos4<y9 or sin4<^>, and F c (ip) = £ + (|i/>|). Similarily, (|3Cl|) can be applied to 
the third term of (|24|), after expanding the cosine; using ( |30| ) term by term, and combining the results afterwards, one 
obtains 



E (Cov(£_ 40) =D 8(#i -$ 2 ) + q 

where 

?.rr 2 r 

dip £+ 



+ r_o + r_ 



(35) 



q — 



r-o 



r-i 



■nAn 
1 

(2n)A 
1 



2#i$2Cos<£> cos(4</?) , 



271 



2 ix 



(f> / dipi£-(\i]j a \) / dtp 2 €-(\il>b\) cos[4(9?i + ip 2 - <fia - <Pb)] 



(36) 



271 



2tt 



4> d^i^+GVal) / dp 2 t, + {\ip b \) cos[4(^! -(pa)] 



(2ir)A 

Finally, the expectation value of (Pq) , the mixed covariance, is calculated in a similar manner, yielding 
E(Cov(|+,tf i; |_,tf 2 )) =g + _+r+_ , 



(37) 



with 



Q+- = 



it An 

1 

ttA 



dip 



' 4 

£ 

Lfc=0 

2tt 



(— l) fe cos(M 



2ti 1 ti 2 cos ip) 2 £_ 



/ d^+GVJ) / d^ 2 £-(IV>&l) cos[4(v? 2 - <p b )] . 



(38) 



Hence, as already seen from (^5|), the cross-covariance matrix has no pure noise term from the intrinsic galaxy ellipticity 
dispersion as the other two covariance matrices. 

We have obtained numerical estimates for the ensemble-averaged covariance matrices derived above (see Fig.Q). 
In the numerical estimates given in this paper (except Sect. 8.2), we have used a standard set of parameters. The 
cosmological parameters are those of a by-now standard A-dominated universe, f2 m = 0.3, Q\ = 0.7. The power 
spectrum of the density fluctuations is described by its primordial slope of n — 1, a shape parameter T = 0.21, 



Peter Schneider et al.: Analysis of two-point statistics of cosmic shear: I. Estimators and covariances 



9 




Fig. 1. The correlation functions. In the left panel, we have plotted the covariance matrices Cov' ++ ( , di, ^2) (thick solid 

curves) and Cov' (i?i,i?2)j i-e. the covariance matrices with the shot-noise term removed. For Gov , $2), the 

contours are linearly spaced, with the lowest value at 10 -9 (outer-most contour) and highest value 9 x 10~ 9 for small 

i?2- For Cov' $2), contours are logarithmically spaced, with consecutive contours differing by a factor 1.5. The 

solid contours display positive values of Cov' (i?i,i?2), starting from 10~ 14 , with the maximum value of ~ 3 x 10~ 9 

in the upper right corner, and dotted contours show negative values of Cov! #2)1 starting at — 10~ 15 . In the right 

panel, Cov_| (i?i,i?2) is shown, again with logarithmically spaced contours differing by a factor of 1.5. Solid contours 

are for positive values of Cov+_(i?i,i?2), starting at 10~ 14 , negative values are shown by dotted contours, starting at 

-10- 13 . 



and a normalization of cr% = 1. We used the fit formula of Bardeen et al. (1986) for the linear power spectrum, and 
the prescription of Hamilton et al. (1991) in the form given in Peacock & Dodds (1996) to describe the non-linear 
evolution of the power spectrum. Furthermore, we fix the survey properties to be described by a fiducial area of 
A — ldeg 2 , a number density n = 30arcmin~ 2 of source galaxies, and an intrinsic ellipticity dispersion of a t — 0.3. 
The source galaxies were assumed to have a redshift distribution p(z) oc z 2 exp[— (z/zq) ], so that the mean redshift 
is z — 1.505 zq. For the examples shown in Sects. 5 through 7, we take zq = 1. Note that all covariances simply scale 
with A^ 1 , so that the results displayed here are easily translated to other survey sizes. This scaling is also implicitly 
implied when considering scales of order a degree or more - all the numerical estimates are for the ensemble averaged 
covariances, and their validity as given here depends on the assumption 9 2 <C A. 

Figure^ displays in the left panel the square root of the (auto)variance of and with Var(£±;t9) = 

Cov(£±, £±, §). To calculate the value of D ( p7j ) which enters the diagonal part of the covariance matrix, we have 
assumed a bin width of A?9 = 0.1^. The square root of the variance - or noise per bin - for £ + is smaller than £ + for 
•d 10', and larger for larger angles (for the assumed value of A = 1 deg 2 ), whereas the noise for £_ is smaller than 

in an interval of 1' $ 30'. The determination of £_ on small angular scales is much more difficult than for £ + , 
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t5[arcmin] i? 2 [arcmin] 



Fig. 2. Left panel: The square root of the variances y Var(£ + ;i9) and y Var(£_;i9) shown as dotted and long-dashed 
curves, together with the correlation functions £+($) and £-($) as solid and short-dashed curves, respectively. The 
model parameters are as described in the text; in particular, a fiducial value of the survey area of 1 deg 2 has been 
taken. For the diagonal part of the covariance matrix, we have assumed a relative bin size of Az9/?9 = 0.1. For small 
the variance behaves as i? , as it is dominated by the noise from the intrinsic ellipticity of the source galaxies, i.e. 
the term D (^?]), whereas for larger values of the main contribution comes from cosmic variance. Right panel: The 
correlation coefficient r corr , as defined in (fj^), as a function of $2, for various values of Solid curves show r corr (^+), 
dashed curves show r corr (£_). The value of $1 corresponding to each curve can be read off from the point where a 
curve attains the value r corr = 1. 



owing to the smallness of for small 1?. Note that the noise scales like A -1 / 2 , so that from a survey of 16sq.deg., 
like the DESCART survey (see van Waerbeke et al. 2001) one should be able to obtain reliable measurements of £+ 
for d 1°, and of £_ for 10" ^5 ■& ^5 2° in bins of relative width of 0.1. Of course, the covariance of the shear will not 
only depend on the survey area, but also on its geometry; one might therefore design survey geometries which yield 
the desired noise behaviour as a function of angular scale (see, e.g., Kaiser 1998). 

In order to show how strongly the correlation estimators at two angular scales are correlated, we define the 
correlation coefficient 

r C orr(£±;Pi,#2) := — p =, (39) 



Cov' (£± , 0i ; £± , 0i) Cov' (£± ,d 2 ; £± , 2 ) 

which is unity for §1 = $ 2 . Here, the prime indicates that the first term in the covariances [the one proportional to 
5(t?i — $2)] has been subtracted off, in order to show the correlation induced by cosmic shear. The correlation coefficient 
is plotted in the right panel of Fig.|^. The £_ correlation function decorrelates quickly: once the ratio between the 
angular scales is larger than ~ 2, the correlation coefficient ( |39| ) has decreased to less than 10%. In contrast to this, the 
correlation function is correlated over much larger angular scales. This was expected, given that the filter function 
which relates the correlation function to the power spectrum is much broader for £+ than for . 
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6. Covariance of the aperture mass estimator 

Due to the particular interest in the aperture mass dispersion, we shall consider here the covariance matrix of the 
estimator (TLA), 



Cov(M;0i,02) - (M(9 1 )M(9 2 )) - (M* p (0i)) <Af a 2 p (# 2 )) . 

Inserting (|l6|) into this expression, the covariance matrix of A4 becomes 



(40) 



Cov(.M; 



■K+(l-K+) 




where we used the abbreviated notation C ++ ($i, i? 2 ) = Cov(£ + , §i] £ + , z? 2 ) etc.. The values of the aperture radius are 
taken to be 9k = mk Az? for fc = 1,2, and §i = (i — 1/2) A^, as before. Hence, the covariance of the aperture mass 
dispersion can be obtained directly in terms of the covariances of the shear correlation functions. We can also obtain 

the ensemble-averaged covariance matrix; for this purpose, we separate the 'delta-function' term in C++ and C 

from the rest and thus write C++(i?i, ^2) = D S(i3i — ^2) + C ++ (z?i, $2), similarily for C Then the ensemble-average 

of (|4l| ) becomes 



E(Cov(M; 0i,0 a )) = 
r20x Mi §1 



2nAn 2 



2miii(0i,0 2 



9\9\ 



KlT + 



T 4 



() 



(1 - K+fT- 







K+{l-K, 



T + 



0i 



C+_(tfi,i? 2 )+T_ 



14 



C+_(l?2,1?l) 



(42) 



As expected, the covariance matrix of M. does not depend on the binning of the correlation function. For the calculation 
of M. for observational data, it is therefore best to choose very small bin widths, in order to minimize discretization 
errors. 

The first term in ([l2|) yields the covariance in the absence of cosmic shear correlations, i.e. the covariance of the 
estimator M due to the intrinsic ellipticity of the source galaxies. This term can be written as 



2nAn 2 9j 



/(0i/0 2 ) = 3.939 x l(T 10 (^y 



.4 



ldeg' 



30 arcmin 



1 arcmin 



/(0i/0 2 ) , 



where 



f(R) = (1-2K+ + 2KD [ dxxT + (x)T + (Rx) = (1-2K+ + 2KD [ dxxT-(x)T 

Jo Jo 



'-(Rx) 



= (1 - 2K + + 2K 2 + )(576) 2 j ^ J»(Jtt) J; 



(*) 



(43) 



where the explicit relation has been obtained by making use of the original definitions of T±. The function f(R) is 
maximized at R = 1, where its value is f(l) 0.29, and it decreases quickly for appreciable ratios of the aperture 
radii. The dependence on K+ is simple, and obviously this noise term is minimized for K+ = 1/2, i.e. when both 
correlation functions enter the estimate M. with equal weight. The variance of the estimator M. in the absence of shear 
correlations can be compared with the corresponding expression obtained in SvWJK, obtained from their equations 
(5.12) and (5.16), yielding 



2ir 2 n 2 9 4 N { ir 2 An 2 9 2 ' 

where G = 1.2 is a numerical coefficient (calculated from the filter function in the definition of M ap ), and N{ is the 
number of independent apertures that can be placed on the data field, taken to be Aj (49 2 ). Together with /(l) s» 0.29, 
one clearly sees that the variance of the estimator M. is substantially smaller than that used in SvWJK which was based 
on placing non-overlapping apertures on the data field. The main reason for this difference is that in the latter method, 
only one component of the source ellipticities is used; furthermore, apertures separated by less than their diameter 
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Fig. 3. Left panel: The square-root of the autovariance of as a function of angular scale. The long-dashed and 
dash-dotted curves show the minimum variance, i.e. in the absence of a shear correlation; this variance is due solely 
to the intrinsic ellipticity of source galaxies. Curves are shown for K + = 0,1/2,1, where the minimum variance is 
the same for K + = and 1. The solid, dashed and dotted curves show the variance in the presence of a cosmic 
shear; also here, the cases K + = and K- = 1 are nearly the same, and the variance is smallest for K + = 1/2. 
For comparison, (M^ p ) (9) is plotted as thick solid curve. As for the other figures shown before, our standard set of 
parameters A — ldeg 2 , n = 30 arcmin -2 and cr e = 0.3 has been used; the variance scales as A" 1 . Right panel: The 
correlation coefficient r corr (A4; 6i, 82) of the covariance of the estimator M. is plotted as a function of #2, for various 
values of 9\\ the values of 9\ can be localized as those points where the correlation function attains the value unity. 
The solid curves are for K + = 0, i.e. when only the correlation function £_ is used in the estimate of A4, the dotted 
curves are for K + = 1/2, and the dashed curves for K + = 1. The width of all three families of curves is very similar 
and (in logarithmic terms) basically independent of 9\ . The K + = curves do not develop a tail of anticorrelation, as 
is the case for K + = 1 (and therefore also for K + = 1/2). Hence, whereas K + = 1/2 yields the smallest variance of 
the estimator Ai, it leads to a small but long-range correlation between different angular scales 

are statistically nearly independent. Hence, the determination of (M f p ) through the shear correlation function is not 
only more practical in the presence of gaps in the data field, but also far more efficient than the alternative method. 

We have plotted in Fig.|^ the square root of the variance Var(.M,#) = Cov(./Vf;#, 9) of M, both for the case of 
no correlations, and for our standard model for the cosmic shear. For small angular scales, the variance is completely 
dominated by the intrinsic source ellipticities, whereas the cosmic variance is the important noise error for larger 
angular scales. For the parameters used here, the transition between these two regimes occurs at a few arcminutes. 
It must be noted that the shape of the variance curves are independent of the survey area A. The results shown in 
Fig. ^] can be seen as an a posteriori justification of using the Gaussian assumption for the calculation of the shear 
four-point function in Sect. 4. In Fig. 4 of van Waerbeke et al. (2002), the influence of the non-linear density evolution 
on the kurtosis of M ap was studied, using ray-tracing simulations through numerically generated cosmological matter 
distributions. The non-Gaussian effects start to become non-negligible for angular scales below ~ 5'. As can be seen 
in Fig.H, this is about the scale where the transition occurs between the variance being dominated by the intrinsic 
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cllipticity distribution and the cosmic variance. Hence, we can expect that a more advanced treatment of the shear 
four-point function would yield a slightly larger variance in this transition region around ~ 5': for significantly smaller 
scales, it is dominated by the intrinsic ellipticity noise, and for larger scales, the shear four-point function is basically 
Gaussian. 

The variances of M. for the cases K + = and K + = 1 arc basically identical, and larger by a factor ~ 2 than 
the variance of the estimator for K + = 1/2. Hence, to minimize the variance of the estimator A4, K + = 1/2 should 
be chosen. With this choice, the results are unchanged even in the presence of a B-mode contribution (see SvWM). 
As was already mentioned by C02, using cosmic shear estimators which use £ + and £_ with equal weight reduces 
the resulting noise by a factor 2 -1 / 2 . One notes that the variance for large rises, but very slowly. We can compare 
the behavior of the variance of M. with that derived in SvWJK for a more direct estimator for the aperture mass 
dispersion; using eqs. (5.12) and (5.16) of that paper, one finds in the limit of large angles (and, to make the estimate 
comparable to the one obtained here, zero kurtosis) that y/Va,r(A4'; 9) w 29 (M^p) (0)/A, where A4' is the estimator 
used in SvWJK. The functional behavior with 9 is similar to that seen in Fig. l3l but the amplitude is lower by a factor 
of about 2 for K + = 0, 1, and about 3 for K + = 1/2; this again shows the superiority of the estimator considered here 
in comparison to laying down independent apertures on the data field. 

To investigate the correlation of the estimator M. between different angular scales, we define the correlation 
coefficient 

r co „(M;9ue*) = - 9^hM , (44) 

y/Gov(M;9 1 ,9 1 ) Cov(M;9 2 ,0 2 ) 

which has the property that r COIT (M.; 9,9) = 1. The dependence of this correlation on the ratio of the angular scales 
then provides information on the correlated error of the determination of the aperture mass dispersion on different 
angular scales. In the right panel of Fig. |] we have plotted this correlation coefficient as a function of 2 , for various 
values of 6%. The logarithmic representation clearly shows that this correlation coefficient depends mainly on the 
ratio 9\/9 2 . The correlation drops off quickly, so that estimates of (M^ ) (9) for two angles with ratio 9i/9 2 1/3 
or 0i /0 2 *S 3 are practically uncorrelated. This was to be expected given that (M^ p ) (9) is obtained from the power 
spectrum P K (£) through a very well localized filter function. Hence, the estimator M. decorrelates much faster than 
those of the shear correlation functions. Also seen in Fig. |] is the fact that in the case of K + = 1/2 and K + = 1, the 
correlation coefficient attains long negative, but low-amplitude tails, whereas they are basically absent if K + = 0. This 
is due to the much faster decorrelation of £_ with scale ratio than that of . 

7. A simple estimator for the power spectrum, and its covariance 

The relations @ may be used to define an estimator for the power spectrum P K (£) as 

P(£) = 2tt / d0 [K+t + (9)J (lff) + (1 - K+)^(0).h(£0)} , (45) 

JOmln 

where K + S [0, 1] again describes the relative contribution from the £ + correlation. Here, # m i n and 9 max describe the 
range over which the correlation function has been measured. If this range extends from zero to infinity, the estimator 
( [45| ) would be unbiased (and would yield the E-mode power spectrum for K + — 1/2 even in the presence of B-modes); 
for real datasets, where this range is finite, ( f45| ) is a biased estimator. Note that in the absence of B-modes, eq. ( |45| ) 
remains valid even if the factor K + is chosen to be a function of £. The expectation value can be obtained by inserting 
the relation (||) between the correlation functions and the true power spectrum into (^5|) to yield 

(P(t)} = P oha (£) = J d£' £' [K + G (£,£') + (1 - K+)G 4 (£J')} P K {£') , (46) 
with 



G n (£,£') = / d6 9J n (£0)J n (£'6) = 



{£'J n+ i(£'0)J n (£9) - a n (£'0)3 n+1 (£0)} 



(47) 



'min 



We have plotted the 'observed' power spectrum as £ 2 P Q bs (£) in Fig. |, for K+ = and K+ = I, assuming that 
= 6" and T[mx — 2°. A comparison with the underlying power spectrum (shown as heavy solid curve) shows 
that P bs traces the true power spectrum over a wide range of ^-values, though in an oscillatory way. If P bs is 
determined solely from £_ (i.e. K + = 0), it substantially underestimates the power for I ^ 10 2 (that is, approximately 
for £ 27r/6' max ), but traces the true power spectrum out to the largest values of £ plotted. Conversely, the observed 
power determined from £ + yields good estimates of the true power for small values of £, but deviates from it strongly 
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Fig. 4. The thick solid line displays the dimensionless projected power spectrum £V K {i), whereas the other two 
curves show the 'observed' power spectrum, as defined in (46). The dotted curve is for K+ = 0, i.e. only enters 
the determination of the observed power spectrum in this case; the dashed curve is for K+ = 1 . In this plot is was 
assumed that the correlation functions are known between m j n = 6" and 9 m in = 2° 



for £ <; 8 x 10 3 , that is for values of £ much less than 27r/# m i n ~ 2 x 10 5 . The different behavior of the two estimates 
again is due to the different filter function through which correlation function and power spectrum are related. Fig.|J 
suggests that the best estimate for the power spectrum is obtained by choosing K+ = 1 for small values of £, and 
K+ = for large £. 

The covariance matrix of P reads, for K+ = 1, 



Cov(P;£,£') = (2tt) 2 



dee j (£9) 



&6' 6' J (£'e') C++ (9,9') 



(48) 



the generalization for other values of K+ is obvious and shall not be reproduced here. 

To estimate the power spectrum from cosmic shear data, it is useful to define the power in a band with upper and 
lower £- values £i U and £n as 



Vi 



1 

a" 



d££P(£) 



i Jti 



2n 



d9 



{k+£+(9) [g+(l m 9) - g+(£iid)] + (1 - K+)£-(0) [<?-(4>0) - 9-(^)} } ,(49) 



where Aj = ln(£ iu /^i) is the logarithmic width of the band, and 



Ji(x) - 8J 2 (a;) 



g+(x) = xJi(x) ; g-(x) = x 



(50) 



One expects that the band power traces £^ P K (£i), where £i is the geometric mean of £i U and £n, i.e. the center of the 
bin. The covariance of the band power of two bins i and j is 



Cov(ViVj) = 



2nat 



A t A j A n 2 



d9 



K 2 + G % +(9)G 3 +{9) + (1 - K+) 2 G^(9)G 3 -(9) 



A, A 7 



d9 [ Bn ^ d9' 



T {K 2 + C%(9,9')G l+ (9)G 3 +(9>) + (1 - K+) 2 CL_(9,9')GU8)G 3 -(0') 



+ K+{l-K+)C+-{9,9') {G t +(9)G 3 -(9') + G t -(9')G 3 +(9)}} 



(51) 



where Gi±(9) — g±(£i n 9) — g±(£i\9). In (|TJ), we have already split off the 'delta-function' part of the correlation 
covariance matrices, which yields the first term. It should be noted that the foregoing expressions remain valid if K+ 
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Fig. 5. The large panel shows the estimates of the band power, shown as horizontal bars whose length indicates the 
bins used. The error bar on each bin shows the square root of the autovariance of the band power, and the solid 
curve is the underlying power spectrum, £ 2 P K (£). For this figure, we have assumed that the correlation functions are 
measured for 6" < d < 2°, from a survey of A = 25deg 2 . The inset fi gure shows the correlation coefficient between 
the 13 different bins, where the triangles indicate the center £ of each bin. One sees that the bands are very little 
correlated, except for the three bins with smallest £; in fact, the first three band power estimates are fully correlated. 
This explains why the band-power estimator yields reasonable results even for £ < 27r/# max ~ 180 - this is just a 
coincidence. 

is varied as a function of £. From Fig.|| it is clear that in order to get the least bias, one wants to choose K + ~ 1 for 
small £, and K + ~ for large £; for the intermediate region, setting K + — 1/2 should yield the smallest error on the 
power spectrum. We have therefore constructed a function K + (£) which has these desired properties. 

In Fig.^ we have plotted the band power for our reference model parameters, in 13 bins of width £i U /£a = 2, 
between £ = 10 and £ w 8 x 10 4 . The band power is shown as crosses, and vertical error bars show the range of the 
bins. For comparison, the solid curve shows £ 2 P K (£); as expected, with this new choice of K + (£), the band power 
traces the underlying power spectrum over a very wide range of wavenumbers. Only in the bins with the smallest and 
largest value of £ is there a significant bias; over the range 27r/6' max w 180 £ 27r/0 ro j n «2x 10 5 , the band power 
estimator is practically unbiased. Next we calculated the error bars on the band power, by taking the square root of 
the diagonal part of (|5l|). For this calculation, we have assumed to have a total area of A — 25deg 2 , for which the 
condition #, 2 nax <C A for the validity of the treatment of the ensemble average in Sect. 5 is approximately satisfied. The 
square root of this autovariance is plotted as errorbars on the band power in Fig.^; as can be seen from this figure, the 
signal-to-noise ratio is larger than unity in all bins shown, and in fact very large for intermediate values of £. Hence, 
the power spectrum P K {£) can be measured over a broad range of £ for the parameters chosen here. 

Of course, in order to interpret the error bars correctly, it is important to see the degree of correlated noise between 
different bands. The correlation matrix for the bins [defined in full analogy to (|44|)] was calculated and its values are 
plotted in the inset of Fig.|5|. One sees that errors of the bins for intermediate and high values of £ are essentially 
uncorrelated (the correlation coefficient for neighboring bins is 10% for £ ^ 200); however, for £ 100 the bins 
become strongly correlated. In fact, the agreement of the band powers with the underlying power spectrum is forticious 
for £ ^ 100: the three band powers at lowest £ are nearly fully correlated, so that these three points contain practically 
the same information of the underlying power spectrum. 

The method presented here for the determination of the power spectrum has the virtue of its simplicity. Other 
methods for determining the power spectrum from shear data have been investigated, e.g. by Kaiser (1998), Seljak 
(1998) and Hu & White (2001). Our approach has the property that it makes use only of the shear correlation 
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Fig. 6. 1-er, 2-cr and 3-ct confidence contours in the ri m -Eg-plane, where Ss is a scaled version of the power spectrum 
normalization parameter as, as indicated. Dotted, dashed and solid contours correspond to x+i X- an d Xtot- ^ n the 
left panel, the shape parameter of the power spectrum T = 0.21, whereas in the right panel, T = 0.7f2 m . The reference 
model is the one used before, i.e. f£ m = 0.3, fl\ = 1 — r2 m , as — 1- The contours are obtained by assuming a survey 
area of A — 5 deg 2 , and that the correlation functions were measured in the range 12" < ■& < 30'. 

functions, not on the spatial distribution of the shear. Since the shear correlation function contains all two-point 
statistical information of the shear field, no information loss occurs. Comparing the results of Fig.[| with those of 
Hu & White (2001) it seems that both methods yield very similar error bars of the power spectrum, and that in the 
respective f-range of applicability, the decorrelation between neighboring bins is equally quick. Since our method does 
not require the 'pixelization' of shear data, it can estimate the power spectrum to larger values of £. 

8. Constraints on cosmological parameters 

One of the central goals of cosmic shear research is the determination of cosmological parameters. Since the power 
spectrum P K , and thus the shear correlation functions, depend on the cosmological model, precise measurements of 
cosmic shear can be used to tie down the range of allowable model parameters (e.g., Jain & Seljak 1997; Bernardeau 
et al. 1997). The largest cosmic shear survey today already yielded significant model constraints (van Waerbeke et al. 
2001, 2002; Hoekstra et al. 2002). We shall briefly discuss the expected confidence regions in parameter space, using 
the previously calculated covariance matrix of the shear correlation functions. For that, we consider a figure-of-merit 
function 

x 2 (p) ■■= E few - tf) Cov ^ few - 0) > ( 52 ) 

where the index * indicates the correlation function of the input model, p is a set of model parameters, and the 
summation indices label the angular bins of the correlation function. We shall consider three different kinds of the 
function ([52]): in the first, named x\ hereafter, the correlation functions in (|52|) are the £+ correlations, and the 
covariance matrix corresponds to (|23|). The second kind is denoted by \ 2 ~ and uses the £_ correlation function. The 
third kind, Xtot' ^ s obtained by constructing a vector £j = (f+i, £+2, ■ ■ • , £,+n , £-1, £-2, ■ ■ • ,£-at), when there are N 
angular bins in which £± has been measured. Correspondingly, the covariance matrix in (p2]) is the 2 x 2-block matrix 
with C++ and C__ in the upper left and lower right quadrant, respectively, and C_| in the other two. 
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8.1. The ag-£l m degeneracy 

We first have calculated x 2 for a two-parameter set of cosmological models, by varying f2 m and er 8 ; these two parameters 
were considered before in van Waerbeke et al. (2001). We kept the other cosmological parameters fixed, except that flat 
Universes are considered, i.e. f2 m +r2A = 1. From the confidence contours plotted in van Waerbeke et al., it is anticipated 
that the % 2 -function, considered as a function of Q m and erg, will have a long and curved valley close to its minimum. 
This was verified here. In it therefore useful to consider the combination Eg := er 8 [0.41 + 0.59(f2 m /0.3) _068 ] as 
parameter when plotting contours of \ 2 (the numerical values occurring in this definition have been obtained by a fit 
through the valley line in the f2 m -<7g plane). 

In Fig. U we have plotted contours of constant x 2 i n the f2 m -Eg plane, corresponding to 1-er, 2-cr and 3-cr confidence 
regions. In the left panel, we kept the shape parameter of the power spectrum fixed, T — 0.21, whereas in the right 
panel we used T = 0.7f2 m , adequate for a (dimensionless) Hubble constant of h = 0.7. Shown are the confidence 
regions for all three functions x 2 ; as indicated. The first point to note is that, for a given value of il m , erg is very 
well constrained, to within a few percent. This implies that the normalization of the power spectrum is very well 
determined from cosmic shear observations. Secondly, the 1-er uncertainty on f2 m is about 0.1 for an assumed survey 
size of A — 5 deg 2 ; indeed, the left panel of Fig. ^| can be compared directly with similar figures in van Waerbeke et al., 
and the constraints on f2 m are quite similar. Third, if the shape parameter changes with f2 m , the confidence regions 
are narrower than when setting T = 0.21 as constant, which implies that the shear correlation functions are sensitive 
measures for T. 

8.2. Additional parameters 

In the prospect of measuring the cosmological parameters, we must also allow for our lack of knowledge of some 
parameters which might have a significant effect on our observable, but which are poorly constrained. In particular, it 
is important to explore the possible directions of degeneracies and to identify which combination of parameters offers 
the most promising measurements. As shown in Jain & Seljak (1997), the shear 2-point statistics depend primarily on 
four parameters: 17 m , er s , T and z s , the source redshift parameter. We now model the source redshift distribution with 

p( z > = /, ; \ — exp 

where we choose a = 2 and (3 = 1.2, and take z s as the free parameter; note that the mean redshift of the sources is 
(z) w 2.09z s . This choice is motivated from observational considerations (see van Waerbeke et al. 2002). We therefore 
constructed the 4-dimensional figure-of-merit function (|52|), where now £j = £ + (#;; il m , erg, T, z s ). The fiducial model 
is chosen to be fi m = 0.3, erg = 1, T = 0.21 and z s — 0.44. It is assumed that the shear correlation function is 
measured in the range 0'.6 < i? < 30', comparable with the most recent cosmic shear measurements. We also fix the 
survey area to A = 16 deg 2 and the galaxy intrinsic cllipticity r.m.s. to 0.3. Figure |?] shows the confidence regions, 
assuming weak priors on pairs of hidden parameters. From the four parameters to be determined, we can construct 6 
pairs of parameters, and for each pair, the other two parameters are marginalized, in such a way that O m € [0.2, 0.4], 
erg G [0.8, 1.1], r € [0.1,0.3] and z s £ [0.4,0.5]. The marginalization intervals are chosen to be consistent with realistic 
constraints coming from other experiments, especially from the Cosmic Microwave Background (e.g., Sievers et al. 
2002), the 2dF (Lahav et al. 2001) and the SLOAN (Szalay et al. 2001) results, and photometric redshifts. Note 
that we always fix the cosmological constant to Q\ = 1 — Q m , that is we assume the (correct) flat geometry. The 
strongest constraints are found for the n m -crg, r-crg and f2 m -r pairs, suggesting that once the redshift distribution 
of the sources is known we can obtain stringent constraints on the cosmological parameters from cosmic shear. The 
degeneracy directions can be better studied with a strong prior likelihood analysis as shown on Fig.|^. On this plot, we 
assume that the two hidden parameters of each constrained pair are known and fixed at their true value. Again we see 
that the best constraints come from pairs of parameters which exclude the redshift information. It is remarkable that 
the degeneracy among these parameters can be broken quite efficiently, given that the cosmic shear signal has so few 
"spectral features", compared to the cosmic microwave background for instance. In fact, the degeneracy can be broken 
when using linear and non-linear scales simultaneously (as discussed in Jain & Seljak 1997). One would obtain an even 
better degeneracy breaking by going further into the linear regime (up to 1 or 2 degrees). Also, it is interesting to 
note that the parameters which are least affected by the marginalization (compare Figures |^ and|^) is the f2 m -er 8 pair, 
which is most directly comparable to cluster normalization constraints (see a discussion on the comparison between 
cluster normalization and cosmic shear constraints in van Waerbeke et al. 2002). 
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Fig. 7. l-cr, 2-<7 and 3-cr confidence contours for the maximum likelihood analysis on the four parameters f2 m , as, T 
and the source redshift parameter z s (see text). The six possible pairs of parameters are displayed. On each figure, the 
two hidden parameters are marginalized such that f2 m 6 [0.2,0.4], cr§ 6 [0.8, 1.1], T G [0.1,0.3] and z s £ [0.4,0.5], and 
the cosmological constant is fixed to ft\ = 1 — f2 m . The reference model is fi m = 0.3, ug = 1, T = 0.21 and z s = 0.44. 
The survey area is A = 16deg 2 , the galaxy ellipticity r.m.s. is 0.3, and the correlation functions are measured in the 
range C.6 < < 30'. 



9. Conclusions 

In this paper we have obtained general expressions for the covariance of an estimator for the shear correlation function 
as it is determined from cosmic shear data. Using the approximation that the four-point function of the shear separates 
in products of two-point function, the covariance can be expressed directly in terms of the correlation functions, as 
given in (p3|-^5|) and can, for a given data set, be calculated directly. The covariance of the correlation functions 
depends on the number of pairs that enter their estimate, which in turn depends on the solid angle covered by the 
survey and the survey geometry (see also Kaiser 1998); in addition, it depends on the intrinsic galaxy ellipticities and 
the number density of galaxies. 

Next, considering a survey geometry of a single compact region of solid angle A, we have calculated the ensemble 
average of the covariances, using approximations which a valid for separations <C \J~A. The ensemble average of the 
covariances can then be reduced to integrals which are readily evaluated numerically. The estimate for the correlation 
function £-($) decorrelates quickly, i.e. estimates of £_ for two angular scales which differ by more than a factor ~ 2 
are essentially decorrelated. On the other hand, the estimates of are correlated over much larger angular scales. The 
cross-correlation between ^_|_ ) and £-(^2) is significant for d\ Si ^2, which is due to the properties of the different 
filters with which these correlation functions are related to the power spectrum P K {(). 
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Fig. 8. Same as figure with strong priors: in each figure, the two hidden parameters as assumed to be known perfectly. 
These plots show the degeneracy directions among all the possible pairs of parameters obtained from f2 m , as, T and 
z$. Note that the wiggles at the edge of the contours are not real features of the probability constraints. Their are 
inherent to the sampling limitation of the 4-dimensional cube of models given the memory limit of our machines. Also 
note that the upper left panel is the analogue of Fig.O, but without the scaling employed there. 



Using these ensemble-averaged covariances for the correlation functions, we have obtained the covariances for other 
two-point measures of the cosmic shear, primarily the aperture mass dispersion and the power spectrum. Of particular 
interest is the reconstruction of the power spectrum P K (£) from the correlation functions; we have constructed a 
simple estimator for P K and the band powers of it in terms of the £'s and found that the band power can be obtained 
with surprisingly large accuracy from even a moderately-sized cosmic shear survey. Finally, we have investigated the 
confidence regions for the most relevant cosmological parameters (f2 m , a&, T and z s ) with a maximum likelihood 
approach. We studied our ability to constrain simultaneously these parameters from a measurement of the shear 
correlation function, as well as the effect of some level of lack of knowledge using the marginalization technique. 

In a future paper, we shall investigate strategies for conducting cosmic shear surveys by optimizing the survey 
geometry. 
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